Improving Quality of Vietnamese Text Summarization Based on Sentence Compression
نویسندگان
چکیده
Sentence compression is a valuable task in the framework of text summarization. In previous works, the sentence is reduced by removing redundant words or phrases from original sentence and tries to remain information. In this paper, we propose a new method that used Grid Model and dynamic programming to calculate n-grams for generating the best sentence compression. These reduced sentences are combined to text summarization. The experimental results showed that our method really effective and the text is grammatically, coherence and concise. Keywords—Sentence compression; topic modeling; text summarization; Grid model; n-grams; dynamic programming
منابع مشابه
A Primary Study on Summarization of Documents in Vietnamese
ION There are some statistical-based sentence extraction methods applied to English documents to get the automatically summaries. In this paper, we present a Vietnamese text summarization case-study based on evaluation and extraction of highly informative sentences to abstract documents, assisting users in reducing the time required to study and grasp information in Vietnamese, particularly app...
متن کاملUsing Coreference Links and Sentence Compression in Graph-based Summarization
Recent years have shown that graphs are an adequate text representation model for summarization. For this years’ TAC update summarization challenge, we extended our graph-based summarization system with coreference relations and sentence compression. Our results show that using coreference relations did not result in a significant performance gain; sentence compression had a negative effect on ...
متن کاملA Feature Terms based Method for Improving Text Summarization with Supervised POS Tagging
Text summarization is the process of distilling the most important information from a source to produce an abridged version for a particular user and task. When this is done by means of a computer, i.e. automatically, it calls as Automatic Text Summarization. Summarization can be classified into two approaches: extraction and abstraction. Extraction based summaries are produced by concatenating...
متن کاملImproving Multi-documents Summarization by Sentence Compression based on Expanded Constituent Parse Trees
In this paper, we focus on the problem of using sentence compression techniques to improve multi-document summarization. We propose an innovative sentence compression method by considering every node in the constituent parse tree and deciding its status – remove or retain. Integer liner programming with discriminative training is used to solve the problem. Under this model, we incorporate vario...
متن کاملFrequent Term based Text Summarization for Bahasa Indonesia
Text summary helps in understanding the content of a text without having to read the contents of the text as a whole. Automatic text summarization can be used to summarize the text easier. In this paper a frequent term based text summarization for Bahasa Indonesia is designed and implemented in java. The proposed system generates a summary for a given input document based on identification and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016